AITopics | dpr model

Collaborating Authors

dpr model

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Decoding Dense Embeddings: Sparse Autoencoders for Interpreting and Discretizing Dense Retrieval

Park, Seongwan, Kim, Taeklim, Ko, Youngjoong

arXiv.org Artificial IntelligenceAug-28-2025

Despite their strong performance, Dense Passage Retrieval (DPR) models suffer from a lack of interpretability. In this work, we propose a novel interpretability framework that leverages Sparse Autoencoders (SAEs) to decompose previously uninterpretable dense embeddings from DPR models into distinct, interpretable latent concepts. We generate natural language descriptions for each latent concept, enabling human interpretations of both the dense embeddings and the query-document similarity scores of DPR models. We further introduce Concept-Level Sparse Retrieval (CL-SR), a retrieval framework that directly utilizes the extracted latent concepts as indexing units. CL-SR effectively combines the semantic expressiveness of dense embeddings with the transparency and efficiency of sparse representations. We show that CL-SR achieves high index-space and computational efficiency while maintaining robust performance across vocabulary and semantic mismatches.

latent concept, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2506.00041

Country: North America > United States (0.46)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Control Token with Dense Passage Retrieval

Lee, Juhwan, Kim, Jisu

arXiv.org Artificial IntelligenceMay-13-2024

This study addresses the hallucination problem in large language models (LLMs). We adopted Retrieval-Augmented Generation(RAG) (Lewis et al., 2020), a technique that involves embedding relevant information in the prompt to obtain accurate answers. However, RAG also faced inherent issues in retrieving correct information. To address this, we employed the Dense Passage Retrieval(DPR) (Karpukhin et al., 2020) model for fetching domain-specific documents related to user queries. Despite this, the DPR model still lacked accuracy in document retrieval. We enhanced the DPR model by incorporating control tokens, achieving significantly superior performance over the standard DPR model, with a 13% improvement in Top-1 accuracy and a 4% improvement in Top-20 accuracy.

dataset, dense passage retrieval, dpr model, (11 more...)

arXiv.org Artificial Intelligence

2405.13008

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Confidence-Calibrated Ensemble Dense Phrase Retrieval

Yang, William, Bergam, Noah, Jain, Arnav, Sheikhoslami, Nima

arXiv.org Artificial IntelligenceJun-28-2023

The passage retrieval problem, which is of central The principal limitation to this approach is its dependence importance in search engine optimization and text on explicit term matches between the analytics, entails the following: given a set of documents query and the context. In many cases, the correct and a query, determine which document best context-query pair may have no words in common.

information retrieval, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2306.15917

Country:

Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Research Report > New Finding (0.69)

Industry:

Law (0.49)
Government (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Information Management (0.87)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.35)

Add feedback